1,221 research outputs found

    The Stellatron Accelerator

    Get PDF

    A General Framework for Accelerator Management Based on ISA Extension

    Get PDF
    Thanks to the promised improvements in performance and energy efficiency, hardware accelerators are taking momentum in many computing contexts, both in terms of variety and relative weight in the silicon area of many chips. Commonly, the way an application interacts with these hardware modules has many accelerator-specific traits and requires ad-hoc drivers that usually rely on potentially expensive system calls to manage accelerator resources and access orchestration. As a consequence, driver-based interfacing is far from uniform and can expose high latency, limiting the set of tasks suitable for acceleration. In this paper, we propose a uniform and low-latency interface based on Instruction Set Architecture (ISA) extension. All the previous studies that proposed extensions, were deeply tailored to address a single accelerator. One of the biggest disadvantages of those methods is their inability to scale. Adding more of these accelerators to one System-on-Chip (SoC) would result in ISA bloat, increasing power consumption and complexifying the decoding phase proportionally. Our proposed framework consists of a six-instruction ISA extension and the corresponding architectural support that implements the interface abstraction and the reservation logic at the hardware level. Our proposal allows controlling a broad class of integrated accelerators directly from the CPU. The proposed framework is ISA-independent, which means that it is applicable to all the existing ISAs. We implement it on the gem5 simulator by extending the RISC-V ISA. We evaluate it by simulating three compute-intensive accelerators and comparing our interfacing with a conventional driver-based one. The benchmarks highlight the performance benefits brought by our framework, with up to 10.38x speed up, as well as the ability to seamlessly support different accelerators with the same interface. The speed up advantage of our technique diminishes as the granularity of the workloads increases and the overhead for driver-based accelerators becomes less important. We also show that the impact of its hardware components on chip area and power consumption is limited

    Intrinsic susceptibility and bond defects in the novel 2D frustrated antiferromagnet Ba2_{2}Sn2_{2}ZnCr7p_{7p}Ga107p_{10-7p}O22_{22}

    Get PDF
    We present microscopic and macroscopic magnetic properties of the highly frustrated antiferromagnet Ba2_{2}Sn2_{2}ZnCr7p_{7p}Ga107p_{10-7p}O22_{22}, respectively probed with NMR and SQUID experiments. The TT-variation of the intrinsic susceptibility of the Cr3+^{3+} frustrated kagom\'{e} bilayer, χkag\chi_{kag}, displays a maximum around 45 K. The dilution of the magnetic lattice has been studied in detail for 0.29p0.970.29 \leq p \leq0.97. Novel dilution independent defects, likely related with magnetic bond disorder, are evidenced and discussed. We compare our results to SrCr9p_{9p}Ga129p_{12-9p}O19_{19}. Both bond defects and spin vacancies do not affect the average susceptibility of the kagom\'{e} bilayers.Comment: Published in Phys. Rev. Lett. 92, 217202 (2004). Only minor changes as compared to previous version. 4 pages, 4 figure

    Study of the Boson Peak and Fragility of Bioprotectant Glass-Forming Mixtures by Neutron Scattering

    Get PDF
    The biological relevance of trehalose, glycerol, and their mixtures in several anhydrobiotic and cryobiotic organisms has recently promoted both experimental and simulation studies. In addition, these systems are employed in different industrial fields, such as pharmaceutical and cosmetic industries, as additives in mixtures for cryopreservation and in several formulations. This review article shows an overview of Inelastic Neutron Scattering (INS) data, collected at different temperature values by the OSIRIS time-of-flight spectrometer at the ISIS Facility (Rutherford Appleton Laboratory, Oxford, UK) and by the IN4 and IN6 spectrometers at the Institut Laue Langevin (ILL, Grenoble, France), on trehalose/glycerol mixtures as a function of the glycerol content. The data analysis allows determining the Boson peak behavior and discussing the findings in terms of fragility in relation to the bioprotective action of trehalose and glycerol

    The prevalence of cubital tunnel syndrome: A cross-sectional study in a U.S. metropolitan cohort

    Get PDF
    BACKGROUND: Although cubital tunnel syndrome is the second most common peripheral mononeuropathy (after carpal tunnel syndrome) encountered in clinical practice, its prevalence in the population is unknown. The objective of this study was to evaluate the prevalence of cubital tunnel syndrome in the general population. METHODS: We surveyed a cohort of adult residents of the St. Louis metropolitan area to assess for the severity and localization of hand symptoms using the Boston Carpal Tunnel Questionnaire Symptom Severity Scale (BCTQ-SSS) and the Katz hand diagram. We identified subjects who met our case definitions for cubital tunnel syndrome and carpal tunnel syndrome: self-reported hand symptoms associated with a BCTQ-SSS score of >2 and localization of symptoms to the ulnar nerve or median nerve distributions. RESULTS: Of 1,001 individuals who participated in the cross-sectional survey, 75% were women and 79% of the cohort was white; the mean age (and standard deviation) was 46 ± 15.7 years. Using a more sensitive case definition (lax criteria), we identified 59 subjects (5.9%) with cubital tunnel syndrome and 68 subjects (6.8%) with carpal tunnel syndrome. Using a more specific case definition (strict criteria), we identified 18 subjects (1.8%) with cubital tunnel syndrome and 27 subjects (2.7%) with carpal tunnel syndrome. CONCLUSIONS: The prevalence of cubital tunnel syndrome in the general population may be higher than that reported previously. When compared with previous estimates of disease burden, the active surveillance technique used in this study may account for the higher reported prevalence. This finding suggests that a proportion of symptomatic subjects may not self-identify and may not seek medical treatment. CLINICAL RELEVANCE: This baseline estimate of prevalence for cubital tunnel syndrome provides a valuable reference for future diagnostic and prognostic study research and for the development of clinical practice guidelines

    IXIAM: ISA EXtension for Integrated Accelerator Management

    Get PDF
    During the last few years, hardware accelerators have been gaining popularity thanks to their ability to achieve higher performance and efficiency than classic general-purpose solutions. They are fundamentally shaping the current generations of Systems-on-Chip (SoCs), which are becoming increasingly heterogeneous. However, despite their widespread use, a standard, general solution to manage them while providing speed and consistency has not yet been found. Common methodologies rely on OS mediation and a mix of user-space and kernel-space drivers, which can be inefficient, especially for fine-grained tasks. This paper addresses these sources of inefficiencies by proposing an ISA eXtension for Integrated Accelerator Management (IXIAM), a cost-effective HW-SW framework to control a wide variety of accelerators in a standard way, and directly from the cores. The proposed instructions include reservation, work offloading, data transfer, and synchronization. They can be wrapped in a high-level software API or even integrated into a compiler. IXIAM features also a user-space interrupt mechanism to signal events directly to the user process. We implement it as a RISC-V extension in the gem5 simulator and demonstrate detailed support for complex accelerators, as well as the ability to specify sequences of memory transfers and computations directly from the ISA and with significantly lower overhead than driver-based schemes. IXIAM provides a performance advantage that is more evident for small and medium workloads, reaching around 90x in the best case. This way, we enlarge the set of workloads that would benefit from hardware acceleration
    corecore